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On  Selecting  the  Best  of  k  Lognormal  Distributions 


Shanti  S.  Gupta  and  Klaus  J.  Miescke 

Purdue  University  University  of  Illinois 

at  Chicago 

Abstract  r , 

■is 

For  k  lognormal  populations,  which  differ  only  in  one  cqctam  parameter  #,  the  problem 
of  finding  the  population  with  the  largest  value  of  0  is  considered.  For  two-parameter 
lognormal  families,  several  natural  choices  of  9  are  treated,  where  the  problem  can  be 
solved,  through  logarithmic  transformation  of  the  observations,  within  the  framework  of 
estimating  parameters  in  k,  possibly  restricted,  normal  populations.  For  three-parameter 
lognormal  families,  this  standard  approach  of  selecting  in  terms  of  natural  estimators  fails 
to  work  if  9  is  the  “guaranteed  lifetime."  For  this  case,  a  selection  procedure  is  derived 
which  is  based  on  an  L-statistic  which  has  the  smallest  asymptotic  variance.  Of  importance 
here  is  that  it  is  location  equivariant,  whereas  it  does  not  matter  what  it  actually  estimates. 
Comparisons  are  made  with  other  suitable  selection  rules,  through  the  asymptotic  relative 
efficiencies,  as  well  as  in  an  example  of  intermediate  sample  sizes.  In  the  latter,  it  is  seen 
that  the  selection  rule,  which  is  based  on  the  sample  minima,  compares  favorably.  > 

!  V  ’’■•/•v-  J  ^ -r.>,  - .  ' — 

AMS  198i0  Subject  Classification:  Primary  62F07  Secondary  62FI2 


Key  Words:  Selection  procedures,  the  best  lognormal  distribution,  L-estimators,  finding 
the  most  reliable  component. 


1.  Introduction 


By  definition,  we  say  that  a  random  variable  X  has  a  three-parameter  lognormal 
distribution  if  ln[X  —  7)  is  normally  distributed  with  mean  n  €  R  and  variance  a2  >  0, 
where  0<7<X<oois  called  the  “guaranteed  lifetime.”  This  distribution  reduces  to 
the  so-called  two-parameter  lognormal  distribution  if  7  is  known,  in  which  case  7  is  set 
equal  to  zero  for  simplicity. 

The  lognormal  family  has  been  studied  and  discussed  thoroughly  in  Aitchison  and 
Brown  (1960)  and  in  Johnson  and  Kotz  (1970).  The  latter  authors  state  that,  “It  is  quite 
likely  that  the  lognormal  distribution  will  be  one  of  the  most  widely  applied  distributions 
in  practical  statistical  work  in  the  near  future.”  Various  applications  have  been  considered 
in  the  literature,  as  is  reported  by  Kane  (1982).  The  main  part  of  interest  has  been  the 
estimation  of  parameters. 

A  special  feature  of  this  distribution  in  reliability  applications,  where  X  represents 
the  timelength  of  life  (or  repair)  of  a  piece  of  equipment,  is  that  the  failure  rate  (or  repair 
rate)  increases  at  first  and  then  eventually  decreases  to  zero,  cf.  Gupta,  McDonald,  and 
Galarneau  (1974). 

Estimation  for  the  two-parameter  family  appears,  at  a  first  glance,  to  be  quite  easy, 
since  the  logarithmic  transformation  of  observations  in  a  given  random  sample  reduces  the 
model  to  the  familiar  normal  family  N(n,o2).  Especially,  maximum  likelihood  estimation, 
which  is  invariant  under  transformations,  apparently  is  straightforward.  However,  estima¬ 
tion  of  parameters  other  than  n  and  a2,  under  different  criteria  of  estimation,  may  lead  to 
substantial  difficulties,  as  it  is  pointed  out  in  Johnson  and  Kotz  (1970).  A  profound  anal¬ 
ysis,  which  includes  the  Bayesian  approach,  is  provided  by  Zeilner  (1971).  Further  Bayes 
estimators  and  illustrative  examples  can  be  found  in  Marts  and  Waller  (1982),  Chapter 
9.3. 

Estimation  for  the  three-parameter  family  turns  out  to  be  much  more  complex.  The 
concept  of  sufficiency  does  no  longer  reduce  a  sample  to  a  two-dimensional  sufficient  statis¬ 
tic,  but  merely  ends  up  with  the  order  statistics.  Even  the  maximum  likelihood  approach 


runs  into  serious  difficulties  since  the  maximised  likelihood  tends  to  infinity  as  7  tends  to 
the  sample  minimum,  cf.  Johnson  and  Kots  (1970).  Methods  to  circumvent  this  difficulty 
as  well  as  other  methods  of  estimation  are  considered  by  Cohen  and  Whitten  (1980),  John¬ 
son  and  Kots  (1970),  Kane  (1982),  and  LaRiccia  and  Kindermann  (1983).  Furthermore, 
estimators  for  the  reduced  model,  where  a2  =  1,  are  discussed  in  Gibbons  and  McDonald 
(1975). 

Let  us  assume  that  there  are  k  lognormal  populations  7,  with  parameters  (7,,  /i,,  a2), 
*  =  1  ,...,fc,  from  which  independent  samples  Xt  of  a  common  sise  n  >  2  have  been 
drawn,  i  =  1, . . . ,  k.  Assume  that  these  populations  differ  only  in  one  single  parameter  0, 
say,  which  may  be  i,n,o2,  or  any  other  parameter  of  relevance  to  a  lognormal  distribution. 
The  problem  considered  in  this  paper  is  to  derive  some  suitable  decision  rules,  based  on 
the  k  samples,  to  find  that  population  which  is  associated  with  the  largest  0-value. 

The  standard  approach  to  this  problem  is  to  compare  populations  7j, . . .  ,7*  through 

*  »  A 

estimators  0<  =  0<(2£i,.  •  *  =  l.-i*.  where  0,  is  an  estimator  of  0,,  for  the  k 

samples  of  sise  n,  which  is  optimum  in  some  reasonable  way.  This  will  be  done  in  Section  2. 
where  the  two-parameter  lognormal  family  case  is  treated.  For  various  choices  of  0,  it  will 
be  shown  that  through  logarithmic  transformation  of  observations,  the  problem  reduces  to 
the  selection  of  normal  parameters  which  has  been  dealt  with  extensively  in  the  literature, 
cf.  Gupta  and  Panchapakesan  (1979)  for  an  overview.  Furthermore,  special  emphasis 
will  be  given  to  the  case  where  0  is  the  maximum  failure  (or  repair)  rate  because  of  its 
importance  to  reliability  applications. 

Section  3  deals  with  the  problem  of  0  =  ^  for  three-parameter  lognormal  families, 
where  we  assume  that  Mi  =  . . .  =  m*  and  a2  =  . . .  =  a2.  Because  of  the  difficulties  arising 
in  an  attempt  to  estimate  7,  it  seems  natural  to  choose  any  other  point  of  location  in  the 
three- parameter  lognormal  density  as  0,  which  can  be  estimated  in  an  easier  and  better 
way.  Thinking  perhaps  first  of  the  mean,  median,  mode,  or  some  fixed  quantile,  one  arrives 
eventually  at  the  following  nonstandard  approach  to  the  present  problem,  where  all  points 
of  location  are  treated  equal,  without  regard  to  their  actual  meaning  to  the  shape  of  the 
density.  From  the  theory  of  L-estimators,  which  is  thoroughly  developed  and  discussed 


in  Huber  (1981)  and  in  Lehmann  (1983),  a  minimum  variance  L-estimator  is  derived  for 
the  three* parameter  lognormal  family,  and  then  it  is  used  for  selecting  the  population 
with  the  largest  “guaranteed  lifetime”  What  is  actually  done  is  to  find  a  location- 
equivariant  statistic  which  has  a  small  variance.  Here,  it  does  not  matter  which  point  of 
location  is  actually  estimated,  and  it  will  be  seen  that  the  theory  of  L-estimators  can  be 
successfully  applied  to  the  present  situation  where  nonsymmetric  densities  are  considered. 
„  uc  performance  of  this  selection  procedure  is  also  examined  in  an  intermediate  sample 
size  example  of  n  =  20,  and  it  is  shown  that  for  this  case,  the  natural  selection  rule,  which 
selects  in  terms  of  the  largest  of  the  k  sample  minima,  compares  favorably. 

2.  Two- Parameter  Lognormal  Populations 

At  the  beginning,  let  us  briefly  recall  some  basic  properties  of  a  two-parameter  lognor¬ 
mal  random  variable  X.  ln(X)  ~  N(p,a *),  i.e.  the  logarithm  of  X  is  normally  distributed 
with  mean  p  €  R  and  variance  a2  >  0.  The  density  and  the  cumulative  distribution 
function  of  X  are  given  by 

(1)  /(*)  =  {ax)~lip(o~l(lnx  -  #*)),  F(x)  =  •(o~I(inx  -  p)),  x  >  0, 

where  <p  denotes  the  density  and  9  the  cumulative  distribution  function  of  the  standard 
normal  distribution  N{0, 1). 

From  the  well-known  identity  for  the  moments  of  X,  cf.  Gupta  (1962),  E(X *)  = 
exp(qp  +  q2a2/ 2),q  =  1,2,...,  we  get  the  following  relations  between  (m, v2),  say,  the 
mean  and  variance  of  X,  and  (p,a2). 

(2)  m  =  exp(p  +  o2/ 2),  v 2  =  exp(2p  +  a2)\exp(o2)  -  l|, 
and 

(3)  p  -  Jn(mj|va  +  m3j  l/J),  o2  -  ln(  1  +  v2 /m2). 

Furthermore,  the  median  of  X  is  exp(p),  the  mode  is  equal  to  exp(p  a2),  and  the  a-th 
quantile  of  X  is  given  by  F ~  1  (o)  -  exp(p  +  " 1  (a) ) ,  0  <  q  <  1. 
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For  applications  in  reliability  problems,  where  X  represents  the  length  of  life  of  a 
piece  of  equipment  (or  a  component  of  a  system),  the  failure  rate  of  X  is  of  importance. 
It  is  given  by  i>(x)  =  /(x)/(  1  —  F(x)),x  >  0.  As  mentioned  earlier  in  the  introduction,  ^ 
increases  at  first  and  then  eventually  decreases  to  zero.  The  following  result  can  be  used 
to  control  the  maximum  failure  rate  in  later  applications. 

Theorem  1.  The  maximum  failure  rate  of  X  occurs  at  time  x0  =  exp(p  +  <ru/0),  where 
tv0  is  the  unique  solution  of  <p(w)/(  1  -  ft(tv))  =  tv  +  ayw  £  R.  Moreover,  the  maximum 
value  of  V  is  equal  to  ^(xo)  =  (1  +  w0/a)exp(-p  -  <rtv0). 

Proof:  By  means  of  the  substitution  x  =  txp(p  +  aw),  tv  £  R,  the  failure  rate  can  be 
written  as 

(4)  V»(x)  =  a~lexp(~p  +  a2/2)<p(w  +  a)/(  1  -  ft  (tv)),  x>0. 

Its  derivative  with  respect  to  w  is  sero  if  and  only  if  tv  satisfies  p(w)/(l  -  ft  (tv))  =  tv  +  a. 
The  ratio  (1  —  ft(tv))/p(tv)  is  called  Mill’s  ratio  and  has  been  studied  repeatedly  in  the  past 
literature.  From  one  of  the  earlier  resources,  Sampford  (1953),  one  can  see  that  the  function 
H{  tv)  =  <p(w)/(l  -  ft(tv))  is  positive  with  0  <  H'(w)  <  1  and  if  "(tv)  >  0,tv  €  R,  is  thus 
convex,  and  satisfies  lim  if  (tv)  =  0,  if  (tv)  >  tv  for  tv  >  0,  and  lim  (if  (tv)  —  tv)  =  0. 

U»— *00 

The  rest  of  the  proof  is  straightforward. 

Let  there  now  be  given  k  two-parameter  lognormal  populations  tr,  with  parameters 
(pt,o*),  or  in  view  of  (2)  and  (3),  with  (m,, v,2),  from  which  independent  samples  2Li  = 
(X,i,. . . ,  XiH)  of  a  common  size  n  >  2  have  been  drawn,  *  =  l,...,fc.  By  tising  the 
transformation  =  ln(XXJ),j  -  l,...,n,t  =  one  gets  independent  samples 

-  {Zih  •  •  • .  %in)  from  N(pi,a2),i  =  1  Selection  procedures  for  the  latter  model 

have  been  studied  extensively  in  the  literature,  where  an  overview  of  the  results  can  be 
found  in  Gupta  and  Panchapakesan  (1979).  In  the  following,  it  will  be  shown  how  various 
selection  problems  for  the  k  lognormal  populations  transform  to  equivalent  problems  for 
k  normal  populations. 

The  most  common  selection  problems  are  those  where  the  k  populations  differ  only  in 
one  single  parameter  9,  say,  and  the  goal  is  to  find  that  population  which  has  the  largest 
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value  of  0.  The  standard  approach  is  then  to  compare  xi ,...,**  through  estimators 

•  a  * 

$i  =  tf,(2C i»*  •  •  ,2Cjt),  *  =  1,. .  •  ,fc,  where  a,  is  an  estimator  of  0,,  for  the  fc  samples  of  size 
n,  which  is  optimum  in  some  reasonable  way.  Several  cases,  which  seem  quite  likely  to 
occur  in  applications,  are  discussed  below. 

Case  I:  o\  —  . . .  —  a\  =  a2 . 

Selecting  the  population  which  has  the  largest  Z%  =  (Zu  +  . . .  +  Zm)/n  is  the  natural 
procedure  for  0  =  /*.  It  has  various  optimality  properties  as  is  reported  in  Gupta  and 
Panchapakesan  (1979).  To  mention  at  least  one,  it  has  the  largest  probability  of  correctly 
selecting  the  largest  /*,,  among  all  permutation  invariant  selection  rules.  From  (2)  one  can 
see  that  it  is  also  optimum,  in  the  same  senses,  for  selecting  the  largest  mj,  i.e.  0  —  m,  or 
the  largest  v2,  i.e.  0  —  v2.  Furthermore,  the  same  holds  if  0  is  chosen  to  be  the  median,  the 
mode,  or  any  fixed  quantile  of  the  two-parameter  lognormal  family.  Finally,  from  Theorem 
1  one  concludes  that 

Corollary  1.  Under  the  assumption  of  a2  —  ...  =  o\,  the  population  with  the  largest 
(smallest!  value  of  p  has  the  smallest  (largest!  maximum  failure  rate. 

Proof:  If  the  variances  a f  are  all  equal,  then  the  solution  two.  which  is  mentioned  in 
Theorem  1,  is  seen  to  be  the  same  value  for  all  populations.  The  proof  is  thus  completed 
by  examining  the  explicit  form  of  the  maximum  failure  rate,  which  is  given  in  Theorem  1. 

Various  ways  of  implementing  this  selection  procedure  under  given  performance  cri¬ 
teria  are  described  in  Gupta  and  Panchapakesan  (1979),  which  include  two-stage  selection 
procedures  for  applications  where  the  common  value  of  a2  is  unknown. 

Case  II:  Mx  =  • .  •  =  M*  =  M- 

The  natural  procedure  for  0  =  a2  is  to  select  the  population  with  the  largest  maximum 
likelihood  estimator  a2  of  o?,t  =  1  ,...,fc,  under  both  situations  where  p  is  known  or 
unknown.  It  has  analogous  optimality  properties  as  the  procedure  in  Case  I,  and  it  can 
be  implemented  under  standard  performance  criteria.  This  time  however,  no  two-stage 
approach  is  needed  if  fi  is  unknown. 


From  (2)  it  can  be  seen  that  this  selection  rule  is  also  appropriate  for  finding  the 
largest  m* ,  i.e.  0  =  m,  or  the  largest  v2,  i.e.  0  =  v2.  Furthermore,  the  same  holds  if  0 
is  chosen  to  be  the  negative  mode  or  any  fixed  quantile  of  the  two-pax ameter  lognormal 
family.  However,  a  result  analogous  to  Corollary  1  does  not  hold.  A  thorough  examination 
of  the  results  in  Theorem  1  shows  that  the  maximum  failure  rate  is  not  a  monotone  function 
of  o2.  To  be  more  specific,  for  M(w)  =  (l  +  w/o(w))exp(—wa(w)),  where  as  in  Theorem 
1,  <t(u>)  =  H(w)  -  w,w  £  R,  one  finds  that 

(5)  o(w)2exp(wo(w))M'(w)  =  J?(tu)[l  -  (H( w)  -  w)2(l  +  tutf(u;))],  w  6  R, 

which  is  seen  to  be  positive  at  u;  =  0,  whereas  it  is  negative  for  sufficiently  small  w  since 
in  (5),  (...)  — >  — oo  as  w  — ►  — oo.  Noticing  that  H( w)  —  w  has  a  negative  derivative  on  R 
completes  the  argumentation. 

Case  III:  v2  =  . . .  =  v2  =  v2 . 

In  this  case,  one  can  see  from  (2)  and  (3)  that  the  problem  of  selecting  the  largest  /x,, 
i.e.  0  =  p,  is  equivalent  to  the  choice  of  0  —  m,  and  to  the  choice  of  0  =  a~2 .  However,  the 
estimation  of  /xj, . . .  ,pk,  and  possibly  v2  in  case  of  an  unknown  v2,  is  a  rather  difficult  task 
under  the  given  condition  of  exp(2pi  +a2)[exp(o2)  -  1]  =  ...  =  exp(2pk  +  ol)  [exp(o£)-l]. 
For  example,  the  maximum  likelihood  estimators  cannot  be  given  in  closed  form  but  rather 
have  to  be  found  through  computer  programs.  This  makes  the  given  selection  problem  less 
attractive  for  applications,  as  long  as  the  common  sample  size  n  is  small. 

For  sufficiently  large  n,  on  the  other  hand,  the  natural  procedure  is  to  select  in  terms 
of  the  largest  sample  mean  X,  =  (X^  +  ...  +  Xin)/n,i  =  1  Since  these  sample 

means  are  independent  and  asymptotically  normal  with  iV(m,,u2/n),i  =  1  ,...,fc,  one 
arrives  at  the  same  situation  which  was  considered  in  Case  I,  where  now  the  X,’s  play  the 
role  of  the  Z^’s  considered  there. 

Case  IV:  m\  ~  —  m*  =  m. 

In  this  case,  (2)  and  (3)  show  that  the  problem  of  selecting  the  largest  o2,  i.e.  0  -  a2 , 
is  equivalent  to  the  choice  of  0  —  v2 ,  and  to  the  choice  of  0  =  /x1 .  Similar  to  the  case 
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considered  before,  estimation  of  a\ , ...,of  under  the  constraint  of  /*i  +  of/2  =  ...  = 
Hk  +  o\l 2  is  difficult,  and  for  example,  the  maximum  likelihood  estimators  have  no  closed 
form.  For  sufficiently  large  n,  however,  the  natural  rule,  based  on  asymptotically  optimum 

n 

estimators,  selects  in  terms  of  the  largest  J2  {X* j  —  m) 2 , *  =  1, . .  .  ,fc,  if  m  is  known,  and 

j=i 

n 

in  case  of  an  unknown  m,  it  selects  in  terms  of  the  largest  ^2  (X,y  —  X),i  — 

__  j=i 

where  X  is  the  overall  average  of  the  kn  observations.  Thus,  one  arrives  at  the  situation 

considered  before  in  Case  II. 

The  General  Case 

Some  comments  have  to  be  made  at  the  end  of  this  section  about  situations  where 
both  Hi  and  of,  *  =  1, . . . , k  vary  freely.  Although  selection  with  respect  to  one  parameter 
0  =  /i(/x,o2),  say,  without  control  of  the  complementing  second  parameter  dimension,  is 
usually  not  acceptable  in  applications,  circumstances  may  arise  where  this  type  of  problem 
is  of  relevance.  To  give  two  examples  from  reliability  considerations,  6  =  /i  +  o$_1(a) 
corresponds  to  selection  of  the  largest  a-th  quantile  for  some  fixed  0  <  a  <  1,  and  0  = 
(li  —  ln(t0))  fa  refers  to  finding  the  largest  probability  of  surviving  a  fixed  time  t0  >  0.  Such 
problems  can  be  treated,  through  the  transformed  variables  ,  within  the  framework  of 
k  independent  samples  from  =  1  ,...,&,  and  they  deserve  to  be  studied  more 

carefully  in  the  future. 

3.  Three-Parameter  Lognormal  Populations. 

Let  us  assume  that  there  are  k  lognormal  populations  n ,•  with  parameters  (7,,  //,, 
of),  *  =  l,...,k,  from  which  independent  samples  Xj=(X.i.  .  ..,Jftn)  of  a  common  size 
of  n  >  2  have  been  drawn,  i  =  1  ,...,k.  The  problem  to  be  considered  is  to  select  that 
population  which  has  the  largest  “guaranteed  lifelength”  7,,  i.e.  0  —  7.  Throughout  this 
section,  we  assume  that  Hi  =  •  •  •  =  Hk  =  M»  say,  and  o2  =  . . .  =  o£  =  o2,  say,  hold,  where 
in  the  first  part,  h  and  o7  are  known.  This  situation  is  of  statistical  relevance,  and  it  is 
complex  enough  to  develop  the  main  ideas  of  the  approach  completely.  Later  it  will  be 
seen  clearly  which  adjustments  have  to  be  made  to  adapt  the  derived  selection  rule  to  the 


8 


case  where  n  and  a 2  are  unknown.  The  derived  selection  rule  will  actually  be  independent 
of  /i,  and  for  o2,  a  pooled-sample  estimator  will  be  found. 

The  standard  approach  to  this  problem  has  been  described  and  applied  in  Section 
2.  For  i  =  l,...,fc,  a  suitable  estimator  0,  =  0,(Xi>  •  •  •  >2Gc)>  based  on  the  k  samples 
of  size  n,  with  good  performance  properties  has  to  be  found,  and  then  the  selection  is 
made  in  terms  of  the  largest  of  the  estimates  . . .  ,0*.  However,  as  mentioned  already  in 
the  Introduction,  estimation  of  parameters  is  in  this  case  rather  difficult,  cf.  Cohen  and 
Whitten  (1980),  Johnson  and  Kotz  (1970),  Kane  (1982),  and  LaRiccia  and  Kindermann 
(1983).  This  is  so  even  if  a2  is  known,  as  it  can  be  seen  from  Gibbons  and  McDonald 
(1975),  where  a2  =  1  is  assumed. 

In  ranking  and  selection  problems  for  location  parameter  families  with  nonsymmetric 
densities,  one  realizes  quickly  that  comparing  the  k  estimates  of  a  “natural”  point  of 
location  in  these  k  densities  may  not  be  the  best  approach.  Rather  than  estimating  in  the 
present  model  7i,. . .  ,7 jt ,  or  the  k  expectations,  medians,  modes,  etc.,  and  then  selecting 
in  terms  of  the  largest  of  the  k  estimates,  one  may  better  search  for  any  functional  of  the 
distribution  which  can  be  estimated  sufficiently  well  by  a  location  equivariant  statistic. 

In  the  following,  we  will  consider  the  class  of  L-estimators,  which  are  linear  com¬ 
binations  of  the  order  statistics  of  a  sample.  To  simplify  explanations  and  notation, 
let  us  first  consider  one  sample  from  a  lognormal  population,  X1,...,Xn,  say,  where 
ln(Xj  -  6)  ~  N(n,o2),j  =  l,...,n,  and  n  €  R  and  a2  >  0  are  known.  An  L-estimator  is 
now  of  the  form 

n 

(6)  = 

;  =  i 

where  X[ij  <  . . .  <  X|n|  denote  the  ordered  values  of  the  sample  Xi , . . . ,  Xn,  and  where  A 
is  a  bounded  weight  function,  which  is  defined  on  the  open  unit  interval  (0,1),  and  satisfies 

i 

/  A (t)dt  —  1.  It  should  be  pointed  out,  however,  that  A  is  allowed  to  assume  also  negative 
0 

values.  A  thorough  analysis  and  discussion  of  this  class  of  estimators  can  be  found  in 
Huber  (1981),  and  in  Lehmann  (1983),  Chapter  5.5. 

The  main  application  of  L-estimators  is  for  location  parameter  families  /  =  {fe}ee R> 


where  fe(x)  =  f(x  —  0),x  €  R,  and  where  /  is  symmetric  about  the  origin.  Then  for  any  A 
which  is  symmetric  about  1/2,  Ln  as  given  by  (6)  is  a  suitable  candidate  for  an  estimator 
of  0.  As  shown  in  Lehmann  (1983),  if  one  takes  weights  tuJn  proportional  to  A [j/(n  +  1)), 
which  add  up  to  one,  then 

n 

(7)  Ln  —  WjnX[j] 

;=i 

is  a  location  equivariant  estimator  of  0,  which  is  asymptotically  normal  and  unbiased, 
provided  that  some  mild  regularity  conditions  on  /  and  A  are  fulfilled,  and  the  asymptotic 
distribution  of  Ln  is  the  same  as  that  one  of  Ln.  The  last  step  in  this  approach  is  to  find 
a  weight  function  A  which  yields  the  minimum  asymptotic  variance. 

To  apply  this  technique  to  the  lognormal  model  under  concern,  one  notices  the  follow¬ 
ing  facts.  Ln  and  Ln  have  still  a  common  asymptotic  normal  distribution.  Theorem  5.1  of 
Lehmann  (1983)  is  formulated  and  proved  without  the  assumption  of  symmetry  of  A  and 
/,  and  the  asymptotic  mean  and  variance  are  given  explicitly  in  dependence  of  A  and  /. 
The  main  idea  for  solving  the  k  lognormal  selection  problem  is  now  as  follows.  Since  Ln 
is  location  equivariant,  it  is  reasonable  to  find  its  value  for  each  of  the  k  samples  and  then 
select  that  populations  which  yields  the  largest  Ln-value.  And  one  realizes  immediately 
that  this  selection  procedure  remains  exactly  the  same  if  Ln  instead  of  Ln  is  employed. 
To  find  now  a  good  selection  procedure,  one  has  to  search  for  a  suitable  weight  function 
A,  which  guarantees  that  Ln  has  a  small  variance  or  asymptotic  variance.  In  this  paper, 
the  latter  approach  will  be  followed. 

At  this  point  one  can  see  that  selection  of  populations  is  not  always  a  comparison 
of  estimates  of  the  location  of  some  natural  point  of  shape  in  the  densities.  Varying  A 
to  minimize  the  asymptotic  variance  of  Ln  makes  no  sense  from  an  estimation  point  of 
view,  since  for  a  nonsymmetric  /,  the  asymptotic  mean  varies  substantially  with  A.  From 
Lehmann  (1983)  one  finds  that  it  is  equal  to 

(8)  u(F, A)  =  [  A {u)F~l{u)du, 

Jo 

where  F  is  the  cumulative  distribution  function  of  /.  But  it  does  make  sense  from  a  selec¬ 
tion  point  of  view,  since  comparisons  of  0i  s  is  equivalent  to  comparisons  of  the  locations 


of  any  fixed  point  of  the  k  densities.  In  other  words,  if  location  equivariant  statistics  are 
employed,  it  does  not  matter  what  they  estimate  as  long  as  they  are  doing  it  well. 


To  minimize  now  the  asymptotic  variance  r2(F,  A),  say,  of  Lnt  Theorem  5.2  of 
Lehmann  (1983)  provides  the  solution.  A  careful  examination  of  its  proof  shows  that, 
with  some  minor  modifications,  it  can  be  used  as  the  proof  of  a  generalized  version  of 
this  theorem,  which  does  not  require  symmetry  of  A  and  /,  and  can  be  applied  to  the 
two-parameter  lognormal  density 


(9) 


f(x)  =  (ax)  1<p(o  1  (Inx  -  fj.))  ,  x  >  0. 


The  construction  of  the  weight  function  Ao,  say,  which  minimizes  t2[F,  A),  proceeds  as 
follows.  It  involves  an  auxiliary  function  g,  which  is  denoted  by  7  in  Lehmann  (1983). 


(10)  g(x)  =  -f'[x)/f[x)  =  x  [l+<7  [Inx  -//)],  x  >  0, 

which  has  the  derivative 


Theorem  2.  For  a  sample  of  size  n  from  a  distribution,  which  has  the  density  given  by 
(9) ,  tfie  statistic  Ln  (or  Ln)  which  has  the  smallest  asymptotic  variance  r2  ( F ,  A)  is  based 
on  the  weight  function 

(15)  A0(t)  =  (1  +  a2)-1(l  -  o[a  +  $-1(t)]) 

•  exp(-2a[a  +  $-1(t)]),  te  (0,1). 

Moreover,  the  asymptotic  variance  and  mean  are  given  by 

(16)  t2(F,  Ao)  =  a2(l  +  a2)~1exp( 2{p  —  a2)),  and 

(17)  u(F,  A0)  =  (1  +  a2)~1exp(p  -  3o2/2). 

It  should  be  noted  that  (17)  has  been  stated  only  for  the  sake  of  completeness.  It  is 
irrelevant  for  applications  in  the  selection  problem  under  concern.  It  follows  from  (8),  i.e. 

(18)  u(F,X0)=  fl  Xo{t)F-l{t)dt. 

Jo 

More  importantly,  one  can  see  from  (15)  that  A0  does  not  depend  on  p.  Thus,  it  does 
not  need  to  be  known  in  applications. 

Now  we  return  to  the  original  selection  problem  for  k  three-parameter  lognormal 
populations,  where  the  observations  Af,y  satisfy  ln(Xij  —  7 i)  ~  N(p,o2),j  =  1, . . . ,  n,  1  = 
1  and  are  independent.  The  goal  is  to  select  the  population  associated  with  the 

largest  7-value,  i.e.  0  =  7.  It  is  still  assumed  that  p  and  a2  are  known.  The  case  where 
they  are  unknown  will  be  discussed  later. 

The  selection  procedure  based  on  L-estimation  is  to  select  that  population  which  yields 
the  largest  Ln-statistic,  based  on  Ao  given  by  (15),  from  the  k  samples.  This  is  justified  by 
recalling  that  the  limiting  distribution  as  n  — ♦  00  of  nl^2\Ln{X_i)  —  li  —  ^(i7’,  A0)]/t(jP,  A0) 
is  N( 0,  l),t  =  1, . . .  ,k.  Thus,  for  sufficiently  large  n,  the  probability  of  correctly  selecting 
the  population  with  the  largest  7-value  is  approximated  by,  say,  P( Ao)  which  is  simply  the 
following. 

/oo  *-1 

n  $  (^  +  »I/2(7[fc]  -  7[tj)A(P,A0)J  <p{z)dz. 

-°°  .=1 


(19) 


It  is  the  well-known  PCS  for  selecting  the  largest  normal  mean,  which  is  tabulated  for 
selected  values  of  k  and  A  =  A0),  cf.  Gupta  and  Panchapakesan 

(1979). 

Any  other  selection  rule  which  employs  Ln-statistics  derived  from  a  different  weight 
function  A  has  analogous  properties.  Everything  mentioned  above  holds  analogously,  A0 
has  simply  to  be  replaced  by  A.  And  since  t(F,  Ao)  <  t(F,  A),  we  see  that  the  selection 
rule  based  on  A0  is  asymptotically  superior  to  all  others  based  on  different  A’s.  To  be 
more  specific,  we  consider  the  concept  of  Pitman’s  asymptotic  relative  efficiency  (ARE) 
for  selection  rules,  which  is  described  in  details  in  Miescke  (1979).  For  two  L-statistics 
and  Li?\  say,  which  are  based  on  Ai  and  A2,  respectively,  one  finds  that 

(20)  ARE  {LW'LM)  =  r2(E,A2)/r2(E,A,). 

To  give  two  examples,  let  L*  be  the  sample  median,  and  let  L**  be  the  sample  mean. 
Then  for  Ln,  based  on  A0,  one  gets  the  following  asymptotic  relative  efficiencies,  which  are 
greater  than  one. 

(21)  ARE  (L,  L*)  —  2-17r(l  +  o2)eip(2cr2),  and 
ARE  (L,  L**)  =  (1  +  <7'2)ezp(3o2)[eip(tT2)  -  lj. 

Thus,  for  the  special  case  of  a2  =  1,  one  finds  that  ARE  (L,L*)  =  23.2.  This  means  that 
asymptotically,  the  selection  rule  based  on  the  k  medians  requires  about  22.2  times  more 
observations  than  the  rule  based  on  Ln,  to  get  the  same  probability  of  a  correct  selection. 
The  result  for  the  sample  means  procedure  is  even  more  impressive.  Here  one  finds  ARE 
(L,L**)  =  69.0. 

To  get  an  idea  how  well  this  selection  procedure  based  on  Ln  through  Ao  performs  at 
an  intermediate  sample  size,  the  following  example  is  considered. 

Example:  n  =  20,  u  =  0.  <7 2  =  1. 


From  the  tables  in  Gupta,  McDonald,  and  Galarneau  (1974),  the  covariance  matrix 
C,  say,  of  the  order  statistics  <  . . .  <  X[n]  of  a  sample  of  size  n  =  20  from  a  lognormal 


distribution  with  n  =  0  and  o2  —  1  can  be  derived.  This  allows  to  determine,  with 
sufficient  accuracy,  the  variance  of  any  linear  combination  of  these  order  statistics,  and 
thus  for  any  Ln  based  on  weights  u>jn,j  =  1, ...  ,n,  as  it  is  shown  in  (7),  by  means  of 

(22)  Var  (Zn)  =  Var  (u>TY)  =  wTCw, 

where  Y_  —  ( , . . . ,  Xjnj)7*,  and  tvT  denotes  the  transposed  of  the  column  vector  w  of 
weights. 

Since  n  is  not  large,  it  is  more  appropriate  to  consider  Ln,  given  by  (7),  since  it  is 
location  equivariant  and  thus  more  suitable  for  comparison  purposes. 

The  values  of  A0(j /21) , j  =  1,...,20,  can  be  found  by  use  of  (15),  where  o2  -  1, 
and  the  values  of  ♦  ~1(j/21),j  =  1, ...  ,20,  which  are  tabulated  in  Mueller,  Neumann,  and 
Storm  (1977),  p.  113.  From  here,  it  is  easy  to  determine  the  weights  w}2o,j  =  1,. . .  ,20, 
which  are  proportional  to  Ao(j/21),j  =  1,..  .,20,  and  add  up  to  one.  The  values  of  the 
weights  w}2 o  of  Xy\,j  =  1,...,20,  have  been  found  to  be  the  following:  .55690,  .21305, 
.10716,  .05996,  .03514,  .02083,  .01209,  .00659,  .00306,  .00080,  -.00063,  -.00149,  -.00196, 
-.00216,  -.00217,  -.00203,  -.00180,  -.00150,  -.00113,  -.00070. 

The  variance  of  L20  is  shown  below,  together  with  the  variances  of  three  other  location 
equivariant  statistics,  which  are  natural  candidates  for  selection  purposes. 

L20  XW  (X„0,  +  •A’ju])/2  X 

.0076  .0076  .0826  .2335 

From  this  one  can  see  that  the  asymptotically  most  efficient  selection  rule  based  on  Lj0 
is  already  at  n  =  20,  much  better  than  the  ones  based  on  sample  medians  and  on  sample 
means.  But,  strikingly,  the  smallest  order  statistic  provides  a  reasonable  alternative.  Thus, 
although  it  is  not  acceptable  as  an  estimator  of  the  “guaranteed  lifetime"  it  turns  out  to 
be  a  very  suitable  tool  for  ranking  and  selection  of  71 , . . . ,  nr*.  This  demonstrates  again  that 
selection  of  location  parameters  for  non-symmetric  densities  is  not  always  just  a  selection 
in  terms  of  estimates  of  a  natural  point  of  the  densities. 


To  complete  this  example,  let  us  mention  that  the  variance  of  Li o,  approximated  by 
A0)/‘20  through  (16)  with  p  -  0  and  a2  -  1,  turns  out  to  be  equal  to  .0034. 

At  the  end  of  this  section,  let  us  deal  with  the  more  general  situation  where  p  and  o2 
are  unknown.  Lack  of  knowing  p  does  not  cause  any  trouble,  since  Ao(f)  does  not  depend 
on  p.  This  can  be  seen  immediately  from  (15).  Moreover,  it  should  be  noted  that  the  two 
specific  asymptotic  relative  efficiencies,  which  are  given  by  (21),  do  not  depend  on  p.  This 
follows  from  the  fact  that  p  contributes  only  a  factor  of  exp( 2p)  to  the  asymptotic  variances 
of  linear  combinations  of  the  order  statistics  of  a  sample  from  a  lognormal  distribution. 

Now,  estimating  a 2  is  not  such  a  difficult  task  as,  for  example,  estimating  h,  ■ .  ■  ,~tk- 
The  moment  estimator  for  one  sample,  as  it  is  given  by  Johnson  and  Kotz  (1970),  p.  124, 
can  be  modified  to  a  pooled-sample  estimator  for  the  k  samples  under  concern  as  follows. 
Let 

k  n 

(23)  -  (kn)  1  £  ^(XtJ  X,)\  g  =  2,3, 

.-i  j - i 

where  X\,. . .  ,Xk  are  the  usual  sample  means.  Since  m2  estimates  in  (2),  v 2  —  exp 
(2p  »  o2)jeip(o2)  lj,  and  m3  estimates  the  third  central  moment,  which  is  the  same  for 
all  of  the  k  populations,  and  is  for  all  t,j  equal  to 

( 24 )  K(\X,}  £(X,;)j3) 

exp(2p  +  Zo2 l2)\exp[o2)  -  1  \2\exp(a2)  +  2], 

it  follciws  that  (m3)2(mj)  3  is  a  suitable  estimator  of  \exp(o2)  l] \exp(o2)  +  2]2.  The 
proposed  estimator  of  a2  is  thus  the  unique  solution  a2  of 

(25)  | exp{a2)  \\\exp{a2)  t  2j2  -  (m3)2(m2)  3. 

Although  one  might  have  some  reservations  about  the  quality  of  this  estimator  for  one 
sample  of  size  n,  in  the  present  situation,  where  usually  k  *  3  samples  contribute  to  d2, 
this  estimator  is  expected  to  perform  sufficiently  well  to  be  used  in  the  selection  procedure 
under  concern. 
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